Phenotypic and genomic signatures of interspecies cooperation and conflict in naturally occurring isolates of a model plant symbiont

Given the need to predict the outcomes of (co)evolution in host-associated microbiomes, whether microbial and host fitnesses tend to trade-off, generating conflict, remains a pressing question. Examining the relationships between host and microbe fitness proxies at both the phenotypic and genomic levels can illuminate the mechanisms underlying interspecies cooperation and conflict. We examined naturally occurring genetic variation in 191 strains of the model microbial symbiont Sinorhizobium meliloti, paired with each of two host Medicago truncatula genotypes in single- or multi-strain experiments to determine how multiple proxies of microbial and host fitness were related to one another and test key predictions about mutualism evolution at the genomic scale, while also addressing the challenge of measuring microbial fitness. We found little evidence for interspecies fitness conflict; loci tended to have concordant effects on both microbe and host fitnesses, even in environments with multiple co-occurring strains. Our results emphasize the importance of quantifying microbial relative fitness for understanding microbiome evolution and thus harnessing microbiomes to improve host fitness. Additionally, we find that mutualistic coevolution between hosts and microbes acts to maintain, rather than erode, genetic diversity, potentially explaining why variation in mutualism traits persists in nature.


Introduction
debated [8][9][10][11][12][13]. Many models of mutualism rely on the key assumption that cooperation is inherently costly and thus that selection should favour less-cooperative, potentially 'cheating', partners that forgo paying the costs while continuing to receive benefits [14][15][16][17]. At the phenotypic level, cheaters would be seen as symbiont genotypes that gain fitness at the host's expense (i.e. points in the bottom-right quadrant of figure 1a), while fitness conflict would be seen as an overall negative correlation (grey line in figure 1a). By contrast, cooperators would be seen as symbiont genotypes whose increase in fitness is associated with an increase in host fitness (i.e. points in the top right quadrant of figure 1a), while fitness alignment would be seen as an overall positive correlation (orange line in figure 1a). Evidence for fitness conflict within mutualism is mixed: although several studies have identified symbiont genotypes that gain fitness at their host's expense (e.g. [18,19]), recent experimental evolution studies have instead found that microbial adaptation to particular host genotypes is associated with an increase, rather than a decrease, in host fitness [20][21][22]. Moreover, fitness alignment at the phenotypic level does not necessarily preclude fitness conflict at the genomic level: rather than dichotomous categories of 'cooperator' or 'cheater', mutualist genomes are best viewed as mosaics of loci [5], some underlying cooperation while others underlie conflict. Whether coevolution resulting from symbiosis leads to more beneficial interactions and greater mutualism stability, or alternatively, more antagonism and less stable interactions, therefore, requires examining the relationships between host and symbiont fitness proxies at both the phenotypic and genomic levels.
Genome-wide association studies (GWAS) can be used to reveal the genes, as well as specific segregating mutations (i.e. variants), that underlie variation in symbiotic extended phenotypes in natural populations [20,[23][24][25][26][27]. Because they provide an estimate of both the strength and direction of effects of particular alleles on the trait, henceforth referred to as allelic effect sizes, GWAS are especially useful for identifying loci underlying symbiotic pleiotropy, and thus defining the mutational landscape of mutualism evolution in nature. For example, if symbiotic pleiotropy is extensive and its effects on fitness-related traits in the interacting partners tend to be discordant (grey quadrants in figure 1b), then conflict should underlie the evolution of mutualism, allowing the possibility of cheating individuals that are competitively superior, as mutualism theory predicts [14][15][16][17]. By contrast, if pleiotropic effects are overwhelmingly concordant (orange quadrants in figure 1b), fitness alignment rather than conflict should be the null hypothesis in mutualism, and models relying on cheating genotypes as the main driver of mutualism evolution may not be suitable. Because GWAS have the power to reveal individual allelic effects, they can be used to identify loci underlying fundamental sources of conflict at the genomic level (e.g. grey quadrants in figure 1b) even if fitness alignment is realized at the phenotypic level (e.g. orange line in figure 1a).
A long-standing mutualism paradox is that host-driven selection for the 'best' symbiont genotype should reduce overall symbiont diversity, yet diverse symbiont populations persist in nature (reviewed by Heath & Stinchcombe [28] and Stoy et al. [29]). Identifying patterns of selection acting on loci that determine fitness outcomes in natural populations could be key for resolving this paradox. Studies to date examining patterns of molecular variation have found stabilizing or purifying selection acting on candidate genes associated with partner recognition or quality, rather than patterns suggesting rapid turnover of alleles underlying conflict [30][31][32][33]. In a recent in silico GWAS [5], conflict over mutualistic trait optima between hosts and microbes tended to increase genetic variance due to repeated sweeps, while alignment scenarios resulted in stabilizing selection and decreased genetic variance as trait optima were reached [5]. Whether the loci associated with symbiotic pleiotropy identified by GWAS in natural symbiont populations show signatures of purifying selection, consistent with fitness conflict, or stabilizing selection, consistent with fitness alignment, remains to be tested.
Over the past approximately 25 years, legume-rhizobium symbioses have been developed as models for understanding mutualism evolution [34][35][36][37]. This interaction is one of the most ecologically and economically important symbioses, contributing upwards of 20 million tonnes of nitrogen (N) to the global N-cycle, and saving billions of dollars that would have otherwise been spent on synthetic N fertilizer production [38]. Legumes house rhizobia within specialized root nodules and supply them with photosynthate, while rhizobia within the nodules convert atmospheric N to a plant-usable form, resulting in a beneficial exchange of resources. Key traits of this symbiosis, such as plant biomass and nodule number, are known to be influenced by variants in the genomes of both host and microbes, as well as the epistatic interactions between them (genotype-by-genotype or G × G interactions; e.g. [1,26,34,[39][40][41][42]), making this symbiosis an excellent model for understanding mutualistic coevolution.
A quantitative comparison of multiple proxies of rhizobium fitness, as well as their correlations with plant fitness, is needed to reveal interspecies conflict. Seed number, or close correlates such as aboveground (shoot) biomass and leaf chlorophyll A content, are well-established proxies for reproductive fitness in annual plants [23,43,44] (figure 1c). By contrast, estimating rhizobium fitness has been an empirical and conceptual challenge [8][9][10]45,46]. Early attempts to estimate rhizobium fitness relied on single-strain experiments, whereby plants were inoculated with a single rhizobium strain, and fitness proxies including nodule number and nodule size were measured (figure 1d). Both measures reflect rhizobium fitness because a rhizobium that establishes a nodule in symbiosis can gain a fitness benefit of the order of 10 5 -10 7 [47], larger nodules release more rhizobia [43,46,48], and, intuitively, strains that produce more nodules will release more rhizobia [1,37,48]. While rhizobium fitness proxies measured in single-strain experiments can be directly correlated with host benefit (e.g. shoot biomass), these measures have been criticized for producing spurious positive correlations due to stronger fitness feedbacks between host plants and rhizobia [9]. By contrast, multi-strain experiments decouple individual strain fitness from host growth and better reflect rhizobia fitness in natural and agricultural soils, where many strains coexist. Recent advances merging population genomics and metagenomics have enabled measuring rhizobium relative fitness, i.e. a strain's ability to compete for nodulation opportunities and extract host resources once in nodules when other strains are present [25,49]. Multi-strain experiments that have estimated rhizobium relative fitness have so far hinted at a surprising lack of correlation between single-strain and multi-strain measures of rhizobium fitness [25], motivating our comprehensive analysis here.
Here, we combine two datasets: first, GWAS that test for the association between rhizobium variants and both plant and rhizobium fitness proxies measured in singlestrain experiments (figure 1c), whereby 191 strains of Sinorhizobiumm eliloti collected from natural populations were inoculated individually onto one of two genotypes of the host plant Medicago truncatula. Second, a GWAS based on a new dataset that measured rhizobium relative fitness in a multi-strain experiment (figure 1d ), whereby 89 of the 191 S. meliloti strains were inoculated together onto the same two host genotypes. We combine these datasets to first ask: What are the relationships among rhizobium fitness proxies measured in both single-strain (nodule number, nodule size) and multi-strain (rhizobium relative fitness) experiments? Are these proxies genetically distinct and thus likely to evolve independently, or are they linked through pleiotropy, and thus, evolve together? We use this information to next address the potential for genomic conflict in this symbiosis, asking: Do variants tend to have aligned or conflicting effects on host and symbiont fitnesses? Finally, we ask whether there is any evidence that historical selection has differentially shaped loci associated with fitness alignment versus conflict, as we might predict under different models of mutualism evolution.

Material and methods (a) Study system
Full details are provided in Riley et al. [50] and electronic supplementary material, Methods. Sinorhizobium meliloti is an alphaproteobacterium that undergoes symbiosis with various Medicago spp., fixing atmospheric N in return for plant photosynthate. All 191 strains used here were collected from the native range of the host M. truncatula, spanning Spain to France (as detailed in [50]). Sinorhizobium meliloti has a multipartite genome, including a large (approx. 3.7 Mbp) chromosome and two symbiosis plasmids ( pSymA and pSymB, approx. 1.4 Mbp and approx. 1.7 Mbp, respectively); pSymA contains many of the canonical genes for nodulation and N-fixation [51,52]. We used two lines of M. truncatula DZA 315.16 (hereafter DZA) and A17 in separate single-strain experiments and a multi-strain experiment detailed below. and were singly inoculated with one of 191 strains of S. meliloti described above. Within each experiment, we measured proxies for both plant fitness (shoot biomass, leaf chlorophyll A content) and rhizobium fitness (nodule number per nodule weight; figure 1c,d ).

(c) Multi-strain experiment
We grew A17 and DZA hosts with a multi-strain inoculum composed of 89 of the 191 strains used in the single-strain experiments (described in more detail in electronic supplementary material, Methods). Using the 'select-and-resequence' approach [25], this experiment allowed us to generate new data on rhizobium relative fitness, which represents a strain's ability to both compete for nodulation opportunities and extract host resources once in nodules when 88 other strains were present (figure 1d ). This fitness proxy was obtained by sequencing pooled nodule samples from each plant at the end of the experiment and estimating each strain's frequency using a haplotype reconstruction method [53]. We then obtained each strain's relative fitness by calculating the fold-change in the frequency of a strain at the end of the experiment relative to its mean frequency at the beginning (see electronic supplementary material, Methods). Because we wanted our rhizobium relative fitness metric to represent which strains will be present in the soil in subsequent generations, our method focuses on measuring strain frequencies of undifferentiated rhizobia in nodule pools; while differentiated bacteroids are responsible for fixing N, they are reproductively sterile, and thus do not contribute to the next generation. Although the plant and microbe growth conditions used here differed slightly from those used in the single-strain experiments (owing to these experiments being conducted in separate places and at separate times), previous work has shown that strain frequencies are stable in response to environmental variation within an experiment, between experiments, and across host generations [49,54]. Thus, environmental variation due to differences in growth conditions is unlikely to significantly influence how strains behave across experiments.

(f ) Genome-wide association tests
Detailed methods are provided in electronic supplementary material, Methods. We conducted linear mixed models (LMMs) implemented in GEMMA (v. 0.98.1, [57]) to quantify allelic effect sizes that represent both the strength and direction of the association between variants and fitness proxies after correcting for rhizobium population structure. We ran ten separate association analyses in GEMMA, one for each of the four fitness proxies measured in single-strain experiments (figure 1c), plus rhizobium relative fitness measured in the multi-strain experiment, and for both host plant lines (DZA, A17; 5 proxies × 2 hosts = 10 runs).

(g) Genomic analyses
We first identified pleiotropic variants as those that were significantly associated with more than one trait on the same host line, significance determined using a permutation method (described in more detail in electronic supplementary material, Methods). Variants that were significantly associated with two or more rhizobium fitness proxies were categorized as being associated with rhizobium fitness pleiotropy, whereas variants that were significantly associated with at least one rhizobium AND plant fitness proxy were categorized as being associated with symbiotic pleiotropy. We further categorized whether pleiotropic variants had discordant (opposite-sign; +,− or −,+) or concordant (same-sign; +,+ or −,−) effects on pairwise fitness proxy combinations (figure 1b

Results (a) Relationships among rhizobium fitness proxies
In single-strain experiments, when we regressed strain means for nodule number and nodule weight, we found significant negative correlations for both hosts ( figure 2a, left), indicating that strains creating larger nodules tended to form fewer total nodules on both host genotypes. At the genomic level, most variants had discordant effects ( figure 2a, right), similarly indicating a trade-off whereby variants that were positively associated with nodule weight tended to be negatively associated with nodule number, or vice-versa.
Comparing single-and multi-strain experiments, when we regressed strain means for rhizobium relative fitness and nodule number, we found a weak but significant negative relationship for host line DZA only ( figure 2b, left), indicating that strains that were more common in the multi-strain experiment formed fewer nodules in the single-strain experiment. At the genomic level on DZA, most variants had discordant effects ( figure 2b, right), again indicating a trade-off between nodule number and rhizobium relative fitness. For host line A17, most pleiotropic variants (20/28) had discordant effects (figure 2b, right), despite no relationship between these fitness proxies at the phenotypic level ( figure 2b, left).
We found a significant, positive relationship between strain means for rhizobium relative fitness and nodule weight, again for DZA only ( figure 2c, left). This result indicates that strains that were more commonly found in nodules in the multi-strain experiment formed larger nodules in the singlestrain experiment. At the genomic level, all pleiotropic variants were concordant (i.e. appearing in the top right or bottom left quadrants of figure 2c, right). For A17, while we found a lack of significant phenotypic correlation between nodule weight and rhizobium relative fitness ( figure 2c, left) the genomic level all pleiotropic variants had discordant effects on these fitness proxies ( figure 2c, right), indicating a trade-off at this scale. These results overall support trade-offs at the phenotypic level, plus underlying discordant pleiotropy at the genomic level, suggesting that strains for which plants form numerous small nodules in single-strain experiments are less able to proliferate within and compete for nodules in multi-strain experiments. We additionally found host-dependent relationships among rhizobium fitness proxies, especially at the genomic level when regressing rhizobium relative fitness and nodule weight ( figure 2c, right). However, we found fewer total pleiotropic variants associated with this relationship compared with the other two fitness proxy combinations (electronic supplementary material, figure S2A, right), suggesting that nodule weight and rhizobium relative fitness are largely governed by different molecular mechanisms and, thus, likely to evolve independently.

(b) Relationships between plant and rhizobium fitness proxies
We found no correlation, in either host, between strain means for nodule number and shoot biomass (figure 3a, left), or nodule number and leaf chlorophyll A content on A17 (electronic supplementary material, figure S3A, left). However, we found a significantly negative correlation between nodule number and chlorophyll A for DZA (electronic supplementary material, figure S3A, left). Assuming chlorophyll A is related to the N-fixation efficiency of each strain [61][62][63], this result suggests that strains forming more numerous (and smaller) nodules tended to fix less N. At the genomic level, most variants had concordant effects for nodule number and shoot biomass on DZA, whereas the opposite was true for the same proxy pair on A17 (figure 3a, right). Most variants had discordant effects on nodule number and chlorophyll A on both hosts (electronic supplementary material, figure S3A, right). Strain means for nodule weight and shoot biomass were significantly positively correlated for both hosts (figure 3b, left). Nodule weight and chlorophyll A were significantly positively correlated on DZA, but uncorrelated on A17 (electronic supplementary material, figure S3B, left). At the genomic level for both hosts, all pleiotropic variants had concordant effects for nodule weight and shoot biomass (figure 3b, right), as well as nodule weight and chlorophyll A (electronic supplementary material, figure S3B, right).
Finally, rhizobium relative fitness and shoot biomass were significantly positively correlated for DZA but not for A17 ( figure 3c, left), whereas rhizobium relative fitness and leaf chlorophyll A content were significantly positively correlated on both hosts (electronic supplementary material, figure S3C, left). At the genomic level, most pleiotropic variants had concordant effects on rhizobium relative fitness and shoot biomass for both hosts (figure 3c, right), and this pattern was even stronger between rhizobium relative fitness and chlorophyll A on DZA, with all but one pleiotropic variant having concordant effects (electronic supplementary material, figure S3C right). Variant effects were mixed on A17 for rhizobium relative fitness and leaf chlorophyll A content (electronic supplementary material, figure S3C, right).
Overall, we found mostly concordant relationships between rhizobium and plant fitnesses at both the phenotypic and genomic levels, suggesting a strong signal of fitness alignment in natural rhizobium populations. However, we note a lack of pleiotropic variants underlying the relationship between leaf chlorophyll A content and both rhizobium fitness proxies measured in single-strain experiments (i.e. nodule number and nodule weight; electronic supplementary material, figure S2C, left and middle), suggesting these proxies are governed by different molecular mechanisms, and thus should evolve independently.
(c) Selection acts differently on genes associated with alignment versus conflict Using multiple diversity and neutrality metrics, we found that rhizobium genes associated with fitness alignment exhibit higher nucleotide diversity and stronger signatures of balancing selection compared with any other gene category analysed. For three of the four test statistics, genes associated with concordant symbiotic pleiotropy (i.e. solid orange lines in figure 4) had significantly elevated values relative to the 'null' (i.e. distributions in figure 4, all genes containing significant variants identified by GWAS). We did not see any significant deviations from the null for genes associated with discordant symbiotic pleiotropy (i.e. dashed grey lines in figure 4) or for genes associated with both concordant and discordant rhizobium fitness pleiotropy (electronic supplementary material, table S1, figure S4 and Dataset S1).

(d) Loci associated with fitness alignment are hostdependent
Comparing the identities and putative functions of variants associated with symbiotic pleiotropy on both host lines revealed little overlap-concordant variants associated with an increase in both high host and symbiont fitness (i.e. fitness alignment) largely differed between the two host genotypes. Specifically, we identified a total of 168 variants associated with symbiotic pleiotropy, corresponding to 128 coding-genes (see electronic supplementary material, Dataset S2 for variant-level and electronic supplementary material, table S2 and Dataset S3 for gene-level summaries); 60 and 93 of these variants were uniquely associated with fitness proxies measured on plant lines A17 and DZA, respectively, while only 15 variants were shared between hosts. We highlight some of the noteworthy genes identified in our analysis for each pleiotropic category in electronic supplementary material, Results.

Discussion
Leveraging genomics to quantify the genetic architecture underlying symbiotic extended phenotypes gives us the power to address long-standing issues in mutualism evolution with genome-scale resolution. Our results overall suggest that: (i) fitness alignment between hosts and rhizobia is common at both the phenotypic and genomic levels, with genes associated with fitness alignment showing elevated nucleotide diversity and signatures of balancing selection; and (ii) the lack of a relationship or even trade-offs between rhizobium nodule number and rhizobium relative fitness means that measures of rhizobium fitness in multi-strain experiments should be prioritized when we want to predict rhizobium evolution. We discuss these main points in turn below.  Figure 3. Fitness alignment between rhizobia (Sinorhizobium meliloti) and host (Medicago truncatula) prevails at both the phenotypic (left panels) and genomic (right panels) levels. Phenotypic: Genetic correlations between pairwise fitness proxies measured on plant lines DZA (green, top rows) or A17 ( pink, bottom rows), based on 89 S. meliloti strains. Dots represent estimated marginal strain means for shoot biomass, nodule number and nodule weight, all being measured in singlestrain experiments, or medians for rhizobium relative fitness measured in multi-strain experiments. Numbers at the top right of each correlation represent Pearson correlation coefficients, while asterisks represent significance: *, p < 0.05; **, p < 0.01; ***, p < 0.001. Genomic: Dots represent allelic effect sizes (i.e. beta-scores calculated in GEMMA), those falling along the diagonal (orange quadrants) or off-diagonal (grey quadrants) represent variants with concordant or discordant effects, respectively. Coloured dots represent variants that were significantly associated with one of the two fitness proxies, while black dots represent pleiotropic variants, i.e. significantly associated with both fitness proxies. Numbers outside and percentages within parentheses at the top left and right of each plot represent the pleiotropic variant counts and the proportion of total significant variants, respectively, that are discordant (left, in grey) or concordant (right, in orange). In one-to-many symbioses, such as a single legume associating with a diverse population of rhizobia, less beneficial symbionts are predicted to achieve higher relative fitness compared with more beneficial counterparts [14][15][16]. While we found evidence for less beneficial rhizobia (i.e. points closer to zero along the y-axes of panels in figure 3), we found little evidence for 'cheating' rhizobia genotypes or loci associated with a gain in fitness at the host's expense (i.e. a lack of points in the bottom right quadrants of figure 3 and few discordant loci associated with symbiotic pleiotropy). Instead, we found strong fitness alignment even in environments where multiple strains occupy the same plant.
Fitness alignment is ultimately governed by the degree to which mutualistic trait optima are shared among partners [5], as well as the degree of fitness feedbacks that enforce alignment between partners [11,[14][15][16][17]. For example, legume host plants have autoregulation of nodulation to limit the formation of costly nodules once sufficient N levels are achieved [64]. No such constraint exists for rhizobia; every nodule formed is expected to lead to greater potential fitness benefits. Thus, a mismatch between the optimum nodule number for a plant versus rhizobium could generate conflict [5,65]. Indeed, the strongest evidence for conflict in our and other studies (e.g. [1,44,48]) comes from regressing plant fitness proxies on nodule number, suggesting conflict over a host's total investment in nodulation. In addition to controlling the total number of nodules formed, legumes can also allocate more carbon to nodules that fix more nitrogen [35,66,67], which acts to couple rhizobium quality and fitness even when multiple strains are present. Such fitnesscoupling mechanisms can be disrupted by mismatches among legume and rhizobium genotypes, allowing strains that fix little to no N to proliferate within nodules [19,68,69]. Our observations of abundant alignment between host fitness proxies, nodule size, and rhizobium relative fitness would predict that such mismatches may be rare in nature, although we only included two host lines and neither was likely to naturally co-occur with our rhizobia strains. Overall, our results suggest that trait optima can be shared even in one-to-many interactions, and that fitness feedbacks operating to align host and symbiont fitness can be maintained even in diverse communities irrespective of coevolutionary history.

(b) Genomic resolution of conflict and alignment in symbiosis
Rather than to identify causal loci underlying symbiotic pleiotropy, the goal of our study was to examine broader patterns of fitness alignment or conflict across the genomes of numerous naturally occurring rhizobium strains. We found abundant concordant pleiotropic variants associated with both host and symbiont fitnesses, alongside evidence that selection has acted to maintain genetic variation within rhizobium genomes through time and/or space. A lack of discordant pleiotropic variants, like we have found, may have resulted from physiological constraints that make such variants impossible (i.e. alleles associated with larger nodules and small host biomass are rare or non-existent), or because of correlational selection that removes discordant variants from the population altogether [70,71]. In addition to direction, the extent of symbiotic pleiotropy (i.e. the number of pleiotropic variants) can inform whether traits are likely to evolve together or independently. Fitness proxy pairs with abundant pleiotropy (e.g. nodule size versus shoot biomass) suggest a highly polygenic basis governed by many small-effect pleiotropic variants; other proxies with little pleiotropy (e.g. nodule number versus chlorophyll) are largely governed by different sets of genetic mechanisms and are thus likely to evolve independently. One of our more critical findings, that the loci associated with both high host and symbiont fitness benefits (i.e. fitness alignment) largely differed across host genotypes, provides one solution to the mutualism paradox: if host genotypes act as distinct selective environments for rhizobia, meaning that the 'best' symbiont genotype differs among host genotypes, then symbiont diversity could be maintained in the face of host selection. Host-dependent loci associated with fitness alignment were also found in experimentally evolved rhizobia isolates [20], and host-mediated balancing selection was previously proposed as a mechanism maintaining rhizobial diversity in native populations of Bradyrhizobium [72]. Our results of heightened nucleotide diversity and signatures of balancing selection acting on genes underlying fitness alignment additionally support the role of host-dependent selection in maintaining diverse symbiont populations in nature.
We nonetheless found several instances of fitness conflict at the genomic level despite alignment at the phenotypic level (e.g. shoot biomass versus rhizobium relative fitness on DZA). Linkage disequilibrium (LD) could lead to discordant alleles being 'packaged' into multilocus genotypes that show fitness alignment. Such LD results from multiple, non-mutually exclusive factors including epistatic interactions among individual variants that render discordant variants effectively neutral, and/or past selection favouring allelic combinations that increase both host and symbiont fitness (and disfavour discordant combinations; [73,74]).
Overall, these findings highlight the polygenic nature of symbiotic extended phenotype variation in nature-where the collective action of individual mutations, their additive and nonadditive effects [42], and a history of selection shapes the trait variation currently present in natural populations [75,76]. However, because of the limitations inherent within GWAS, including LD which makes it difficult to pinpoint causal variants as well as false positive and negative associations, we acknowledge that the function of any specific locus identified here needs to be further validated in followup experiments. Additionally, we only focus on allelic substitutions rather than presence/absence variation generated via gene gain and loss, the latter of which previous studies have found to be associated with exploitative traits in rhizobia (e.g. [67]). Whether fitness conflict is predominantly associated with genetic variation generated via gene gain/loss rather than allelic substitution remains to be tested in our dataset.
(c) Trade-offs among rhizobium fitness proxies and the rhizobium competition problem Understanding how microbial symbioses, which are ubiquitous and important in nature, evolve (or coevolve) requires accurate estimates of symbiont fitness in ecologically realistic settings. Given our evidence at both the phenotypic and genomic levels for trade-offs among rhizobium fitness proxies, and because diverse strains of rhizobia co-occur in nature, relative fitness proxies should be used whenever possible [25]. Nevertheless these proxies are not replacements for those measured in single-strain experiments because they cannot be used to assign individual genetic means for whole-plant measures of host benefit ( plant biomass and seed number) to individual strains (e.g. [1,26,39,42]), necessitating that host benefit and rhizobium fitness be measured on separate individuals. Together our results suggest that the genetic architectures associated with rhizobium fitness proxies, and their relationships, are host-dependent, and thus that their evolutionary trajectories are influenced not only by variation in the rhizobium's genome, but also by variation in host traits. For example, host genotypes could differ in their ability to exert sanctions or partner choice, quantitative traits known to vary in host populations [44,77]. At the genomic level, distinct variants underlying rhizobium fitness pleiotropy on each host genotype suggest that the genetic mechanisms (i.e. genes, pathways, metabolic processes) governing the relationship between fitness proxies are largely non-overlapping when rhizobia interact with different hosts. Such host genotype-dependent shifts in the rhizobium genetic architecture of these symbiotic extended phenotypes are supported by transcriptomics studies of G × G interactions [25,40] and GWAS revealing distinct sets of candidate genes in different host backgrounds [26,42]. Similar genetic variation exists in hosts [78] and undoubtedly interacts with the variants we identify here, and thus, should be accounted for if we want to uncover the multi-genomic basis of symbiotic extended phenotypes.